Classification Ratemaking Using Decision Trees

نویسنده

  • Nasser Hadidi
چکیده

Introduction Manual rating of specific risks begin with a base rate, which is then modified by appropriate relativity factors depending on characteristics of each risk. Classical methods of deriving indicated relativities, are described by McClenahan (1996) and Finger (1996). A number of different modeling procedures are described in Brown's (1988) "minimum bias" paper and Venter's (1990) review of Brown's paper. These methods generally rely on the "multiplicative" or "additive" assumptions, which may not be reasonable for all types of risk. In this paper an alternative method of calculating indicated relativities is described, and demonstrated using a commercial Business Owners' Product (BOP) data set. relativities are compared with observed relativities, thereby demonstrating the extent of suitability of this method. It should be stressed that the intent here is entirely demonstration of a procedure. For actual practical implementation, modification would be required. First a few words about the ten-ninology and the data set. Relativities are based on grouping of risks with similar risk characteristics. This is essentially a classification problem. The purpose of any classification procedure is partitioning of objects-in our case risks-into demonstrably more homogeneous groups. For the BOP data we seek groups of risks with significantly differing claim frequencies, severities, pure premiums or loss ratios. Typically partitioning is based on a number of risk factors, which for BOP Distinctions must be made between these rating factors, which are used to group risks together and variables such as frequency, severity, pure premium or loss ratio, which 254 must be estimated partially based on these risk factors. The former are independent or predictor variables while the latter are dependent variables. Both of these variables may be assumed to be either categorical or vary continuously in a given interval. The statistics used as the basis of classification depends on whether the dependent and/or the independent variables are categorical or interval scale. For risk classification the independent variables are typically categorical. For example the BOP data includes losses for three different coverages, in 51 different risk states, and with 178 different ISO territory codes, 6 different ISO construction codes, etc. It is customary to say that the classification variable-risk state-has 51 different levels. Similarly, there axe six levels of ISO construction code, etc. The BOP data set under consideration includes 27,854 claims with accident years 1997 through 2000, and 9011 claims with accident year 2001. These are broken down by: Number of Levels Risk …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting The Type of Malaria Using Classification and Regression Decision Trees

Predicting The Type of Malaria Using Classification and Regression Decision Trees Maryam Ashoori1 *, Fatemeh Hamzavi2 1School of Technical and Engineering, Higher Educational Complex of Saravan, Saravan, Iran 2School of Agriculture, Higher Educational Complex of Saravan, Saravan, Iran Abstract Background: Malaria is an infectious disease infecting 200 - 300 million people annually. Environme...

متن کامل

A New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining

Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...

متن کامل

Statistical Learning Algorithms Applied to Automobile Insurance Ratemaking

We recently conducted a research project for a large North American automobile insurer. This study was the most exhaustive ever undertaken by this particular insurer and lasted over an entire year. We analyzed the discriminating power of each variable used for ratemaking. We analyzed the performance of several models within five broad categories: linear regressions, generalized linear models, d...

متن کامل

Ensemble Classification and Extended Feature Selection for Credit Card Fraud Detection

Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...

متن کامل

A Systematic Relationship between Minimum Bias and Generalized Linear Models

The minimum bias method is a natural tool to use in parameterizing classification ratemaking plans. Such plans build rates for a large, heterogeneous group of insureds using arithmetic operations to combine a small set of parameters in many different ways. Since the arithmetic structure of a class plan is usually not wholly appropriate, rates for some individual classification cells may be bias...

متن کامل

Application of Different Methods of Decision Tree Algorithm for Mapping Rangeland Using Satellite Imagery (Case Study: Doviraj Catchment in Ilam Province)

Using satellite imagery for the study of Earth's resources is attended by manyresearchers. In fact, the various phenomena have different spectral response inelectromagnetic radiation. One major application of satellite data is the classification ofland cover. In recent years, a number of classification algorithms have been developed forclassification of remote sensing data. One of the most nota...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003